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Abstract 

We apply a genetic programming technique to search for the doubly Cabibbo sup- 
pressed decays A+ pK^TT~ and K~^K^tt~ . We normalize these decays to 
their Cabibbo favored partners and find BR(A+ pK+7r^)/BR(A+ pK^ir^) = 
(0.05 ± 0.26 ± 0.02)% and BR(D+ ^ K+K+7r-)/BR(D+ ^ K-K+tt+) = (0.52 ± 
0.17 lb 0.11)% where the first errors are statistical and the second are systematic. 
Expressed as 90% confidence levels (CL), we find < 0.46% and < 0.78% respectively. 
This is the first successful use of genetic programming in a high energy physics data 
analysis. 

Key words: Genetic Programming 
PACS: 13.25.Ft, 13.30.Eg 



Cabibbo suppressed (CS) and doubly Cabibbo suppressed (DCS) decays are 
important in helping us understand the dynamics of hadronic decay pro- 
cesses. DCS decays are unique to the charmed hadrons; charm is the only 
heavy up-type quark that hadronizcs. DCS decay rates are such that only 
DCS decays of D"*" and D° have been observed, while CS decays of nearly 

^ See http://www-focus.fnal.gov/authors.html for additional author information. 
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all the charmed hadrons have been observed. This paper presents a search 
for DCS decays of A+ and . Both branching ratios are expected to be 
small. Naive expectations place DCS branching ratios around tan"^ 6c, or 
about 0.25%, relative to their Cabibbo favored (CF) counterparts. Lipkin ar- 
gues [1] that exact SU(3) symmetry would require the product of the DCS 
relative branching ratios BR(D"'" K^7r^7r+)/BR(D^ K'tt'^tt'^) and 
BR(D+ K+K+7r-)/BR(D+ ^ K^K+tt-) to be exactly tan^ 9^. This means 
the latter should be about 0.07%; a much larger value requires a large violation 
of flavor SU(3). In the A+ case, the CF normalizing mode has a c-d W"*" ex- 
change decay channel available, while the DCS decay mode may only proceed 
through spectator decays. The lifetime difference between and shows 
us that this exchange mode is important, so we expect that the branching 
ratio for A+ — > pK^TT~ should also be less than tan"^ 9c. 

We have apphed a genetic programming (CP) [2] technique to search for the 
DCS decays Df — ^ K~^K~^n~ and A+ — pK^7i~ (charge-conjugate states 
are implied), neither of which have been observed. CP is a machine learning 
technique which evolves populations of programs (event filters in our case) 
over a series of generations. The genetic programming learning mechanism is 
modeled on biological and evolutionary principals and differs from some other 
machine learning solutions in that the form of the solution is not specified in 
advance but is determined by the complexity of the problem. A full demon- 
stration of this technique on the observed DCS decay — > K~^7r~^7r~ is given 
in Reference 3. 

These results use data taken with the charm photoproduction experiment FO- 
CUS (FNAL-E831), an upgraded version of FNAL-E687 [4] which collected 
data using the Wideband photon beamline during the 1996-1997 Fermilab 
fixed-target run. The FOCUS experiment utilizes a forward multiparticle spec- 
trometer to study charmed particles produced by the interaction of high energy 
photons {{E) ^ 180 GeV) [5] with a segmented BeO target. Charged parti- 
cles are tracked within the spectrometer by two silicon microvertex detector 
systems. One system is interleaved with the target segments [6]; the other 
is downstream of the target region. These detectors provide excellent sep- 
aration of the production and decay vertices. Further downstream, charged 
particles are tracked and momentum analyzed with a system of five multi- 
wire proportional chambers [7] and two dipole magnets of opposite polarity. 
Three multicell threshold Cerenkov detectors are used to identify electrons, 
pions, kaons, and protons. FOCUS also contains a complement of hadronic 
and electromagnetic calorimeters and muon detectors. 

We use loose analysis cuts on both DCS and CF decay modes to select initial 
samples of events for optimization by CP. The FOCUS Cerenkov algorithm [8] 
returns negative 2 x log- likelihood values Wi{j) for particle j and hypothesis 
i & e,TT, K,p. Differences between log-likelihoods are used as particle ID, such 
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Fig. 1. Initial A+ (left) and (right) data samples. The upper distributions show 
the CF decay candidates, the lower distributions show the DCS candidates. In the 
plot, the D'^ K^TT^ir^ rejection cut described in the text is not applied and 
a contribution to the fit for this reflection is visible to the right of the Df peak. 

as AWxpip) = ^k{p) — Wp{p) for "proton favored over kaon." We require 
AWj^xiK) > 2 for all kaons in both decay modes. For protons from candi- 
dates, we require AW^p(p) > 4 and AWxpip) > in the initial selection. For 
the A^, we also require that the separation between the production and decay 
vertices, i, is greater than 3 times its error, ae. For the Dj", the vertex separa- 
tion requirement is i/ai > 6. For both and , the three decay fragments 
must form a vertex with a confidence level (CL) > 1%, and a production ver- 
tex is formed by adding as many remaining tracks to the charm candidate as 
possible while maintaining a vertex CL > 1%. One additional requirement is 
placed on the CF (DCS) Dj" candidates: the K^K^tt^ (K^K^tt^) combina- 



tion is rejected if, reconstructed as K 



TX ' TT 



[K^Ti^Ti ), the mass is within 2(j 



of the nominal D mass. This cut removes a prominent reflection from the CF 
candidates and stabilizes the many fits done during the optimization process; 
it is applied to the DCS mode for consistency. The initial samples of and 
candidates in CF and DCS decay modes are shown in Fig. 1. 

A CP framework (GPF) evolves and tests event filters. For each filter, we 
define a fitness 



r BdCS 

f oc — ^ X 

^CF 



1 + 0.005 X N, 



nodes J 



where -Bdcs is the number of background events found in a fit of the DCS 
mass distribution which excludes the signal region, and Sqf is the fitted CF 
yield. Scf / V Bbcs is proportional to the projected DCS significance assuming 
no real DCS events and equal CF and DCS selection efficiencies; squaring 
this quantity further emphasizes "better" filters and inverting it allows small 
fitnesses to describe good event filters. A^nodes is the total number of variables, 
constants, functions, and operators used in the filter and is included as a 
penalty term to encourage smaller filters and to attempt to eliminate the 
addition of nodes which do not select events based on physics. 
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For the data samples in Fig. 1, half the events (as explained later) along with 
a large number of variables (37 for A+, 34 for D^), operators and functions 
(21), and constants are used as inputs to the GPF which randomly generates 
a large number of filters and calculates the fitness of each. The GPF prefer- 
entially selects filters for which this fitness is small to participate in breeding 
subsequent generations of filters. In this way subsequent generations develop 
filters with better average fitnesses. At the end of the process we use the filter 
with the single best fitness to select events for further analysis. 

The variables and resulting filter used in the CF and DCS decays are identical. 
All variables commonly used in FOCUS analyses and some additional variables 
are allowed to be used in the event filter. These can be roughly broken into 
categories of vertexing, track quality, particle identification, production and 
decay kinematics, away-side charm tagging and, for the A+, evidence for decays 
of the excited states E^*)°'++. A description of the variables^ used, examples 
of the event filters constructed, and how the population of filters evolves over 
many generations can be found in Reference 3. In both cases we use 20 sub- 
populations of 1500 event filters per generation as described in Reference 3. 

When searching for A+ pK^n~ and K^^K'^n' decays (with the 

signal regions masked), the GPF is allowed to run for 80 generations. The 
process is terminated when no improvement in fitness is observed for about 
10 generations. The best A+ — pK'^7r~ filter found has 45 nodes and uses 
12 unique physics variables. The events selected are shown in Fig. 2. One 
can see that about 15% of the signal is retained compared to Fig. 1 while 
the backgrounds are reduced by a factor of ~1000. The distributions in both 
the CF and DCS cases are fit with a second degree polynomial^ and a single 
Gaussian. In the DCS case, the Gaussian mean and a are fixed to the CF values 
and we find 1.2 ± 6.6 events. Correcting for the relative efficiency ecF/^DCS — 
1.204±0.007 (stat.) calculated with Monte Carlo (MC) simulations, we obtain 
a relative BR of 

BR(A+ ^ pK+TT-) , 

gA^^-^^= (0.05 ±0.26)%. (2) 
which is consistent with zero. 

The best K^K'^n^ filter found has 85 nodes and uses 15 unique physics 

variables. The events selected are shown in Fig. 3. The fits shown are performed 



^ In addition to the variables described in Reference 3, we add three additional 
variables: AWKTri'^)^ the number of tracks in the production vertex, and a value 
indicating if any of the vertex detector track segments are shared between two 
tracks. 

^ No significant reflections in A+ pK^TT~ or — > K+K+vr" are seen in high- 
statistics MC studies (which include all known cc decay processes) of these decays, 
so we are justified in using simple background shapes. 
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Fig. 2. samples after selection. On the left is the CF normalizing mode, on the 
right, the remaining pK^ir^ candidates. 




Fig. 3. samples after selection. On the left is the CF normalizing mode, on the 
right, the remaining Dj" — > K'^K^tt" candidates. 

identically to the case except that an additional Gaussian is added to the 
CF distribution for the CS decay D+ K-K+ti+. We find 27.5 ± 9.2 events 
in the DCS distribution which, corrected by the relative efficiency ecr/eDcs = 
1.154 ± 0.005, gives a relative BR of 



In both cases our central values are calculated assuming non-resonant decays 
for the DCS case and the best known resonance models for the CF decays (the 
PDG [9] model for A+ and a FOCUS model for D'^) as explained below. 

To convert these relative BRs into upper limits including systematic errors, 
we use a method proposed by Convery [10] for incorporating systemic uncer- 
tainties on reconstruction efficiencies into BR measurements when a fit, rather 
than event counting, is used. In this case, the probability P{B) of the true BR 
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being B is given by 



p{B) oc , == exp 



B2 52 
2^ 2^ 



iB-B) 



(4) 



where B is the fitted BR, as is its error and as/S is the percent systematic 
error on the efficiency. P{B) is numerically integrated until the point at which 
90% of the physical (BR > 0) area is included. This point is reported as the 
90% confidence level. If as/S 10%, this distribution has a long high-end 
tail, raising the 90% limit considerably. 

We consider four sources of systematic error on our knowledge of the relative 
efficiencies of the CF and DCS decay modes. First, and negligible, is the num- 
ber of MC events used. Second and third, we consider the effects of different 
resonance models for the DCS and CF states respectively. Finally, we consider 
whether the evolved event selector may have different efficiencies for the CF 
and DCS modes. 

In studying possible resonances for pK'^7r~ candidates, we calculate 

efficiencies as if the final state is entirely non-resonant or entirely — >■ 
A(1232)*^K^ or A+ — * pK*(892)°. The systematic error is taken as the stan- 
dard deviation of the three possible efficiencies. For the — > K~^K~^'k~ 
candidates, we consider non-resonant decays, — > K*{892)^K'^, and — > 
Ko(1430)°K+ in the same way. From these studies we find 5.3% and 10.7% 
systematic uncertainties on the A+ and efficiencies respectively. 

The resonant structures of the CF decays are reasonably well known. For 
the A+, we use two models, one from the PDG and another which excludes 
the A(1520)''7r"'" decay mode. For the D^, we consider an incoherent model 
based on the PDG averages and a coherent model [11] developed from the 
FOCUS data. From these studies we find 2.1% and 2.6% uncertainties on the 
A+ — > pK~Tr'^ and Df K'K'^tt^ efficiencies, respectively. 

Our final systematic contribution is motivated by the concern that the effi- 
ciency of the final GP-generated event filter may differ for the CF and DCS 
modes (after correction for kinematic acceptance of different final states) in 
a way that is not well modeled by MC. Since this is impossible to measure, 
we adopt a more rigorous test. We test if the event filter has the same effi- 
ciency on CF data and MC events. We do this by comparing the CF yields 
of data and MC samples before and after the event filter is applied. ^ For 
the A+, we find that the event filter retains 14.5 ± 0.4% and 14.9 ± 0.1% of 
the data and MC events, respectively. For the D^, we determine these quan- 
tities to be 21.0 ± 0.4% and 20.3 ± 0.1%. We take the differences between 



^ The K ir'^Tr'^ rejection cut is applied with the event filter in the case. 
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Table 1 

Summary of systematic uncertainties. Listed are the percent uncertainties on the 
relative efHciencies of the DCS and CF decay modes from various sources. 





Syst. Unc. (%) 


Source 


A+ 


Dt 


MC statistics 


0.6 


0.4 


DCS resonances 


5.3 


10.7 


CF resonances 


2.1 


2.6 


CP filter 


2.6 


3.5 


Total 


6.3 


11.6 



these numbers (neglecting the errors) as systematic uncertainties; these cause 
2.6% and 3.5% uncertainties on the relative efficiencies for A+ — > pK'^n^ 
and — > K'^K'^n' respectively. All systematic uncertainties on the relative 
efficiencies are summarized in Table 1. 

Finally, as mentioned above, we only use half (even-numbered) of the events 
in the optimization of the event filter. The final values use the event filter 
applied to all the events, but as a check, wc divide the sample into events the 
GPF used and did not use. We measure the BR independently for these two 
samples and see no significant evidence for a difference, strongly suggesting 
that the GPF is not arbitrarily selecting or rejecting small numbers of events 
to artificially reduce backgrounds or enhance signals. 

Using the total percent errors in Table 1 as as/S and the above BRs, statis- 
tical errors, and percent systematic errors, we integrate P{B) from Eq. 4 as 
described and find 

BR(A?^pK-.-)_^^^^^„^^ (5) 



BR(A+ ^ pK-n+) 
and 

BR(D+ ^ K+K+TT-) 



< 0.78% (6) 



BR(D+ ^ K-K+7r+) 
where the limits are at the 90% CL. We also determine effective system- 
atic uncertainties for our measurements by calculating the uncertainty nec- 
essary, when added in quadrature to the statistical uncertainty, to cover 
the central 68% of the distrubution in Eq. 4. By this method, we find 
BR(A+ ^ pK+7r")/BR(A+ ^ pK"7r+) = (0.05 ± 0.26 ± 0.02)% and 
BR(D+ ^ K+K+7r-)/BR(D+ ^ K-K+Tr+) = (0.52 ± 0.17 ± 0.11)% where 
the first errors are statistical and the second are systematic. The distributions 
described by Eq. 4 and the 90% integrals for the DCS decays are shown in 
Fig. 4. Both limits are larger than the expected (< tan"^ 6c) level, but are the 
first reported limits on these decays. Furthermore, this is the first successful 
application of the GP technique to an HEP data analysis. 
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Fig. 4. Relative BR limit determination for — > pK^n' (left) and 
D'^ — > K^K~^ 7r~ (right). The curves show the branching ratio probability for DCS 
decays relative to CF decays. The vertical axes are arbitrary. The shaded areas show 
the 90% integrals over the physical range. 

Acknowledgments 

We wish to acknowledge the assistance of the staffs of Fermi National Ac- 
celerator Laboratory, the INFN of Italy, and the physics departments of the 
collaborating institutions. This research was supported in part by the U. S. 
National Science Foundation, the U. S. Department of Energy, the Italian Isti- 
tuto Nazionale di Fisica Nucleare and Ministero dellTstruzione deU'Universita 
e della Ricerca, the Brazilian Conselho Nacional de Desenvolvimento Cientifico 
e Tecnologico, CONACyT-Mexico, the Korean Ministry of Education, and the 
Korean Science and Engineering Foundation. 



References 



[1] H. J. Lipkin, Puzzles in hyperon, charm and beauty physics, Nucl. Phys. Proc. 
Suppl. 115 (2003) 117-121. 

[2] J. R. Koza, Genetic Programming: On the Programming of Computers by 
Means of Natural Selection, The MIT Press, Cambridge, Massachusetts, 1992. 

[3] J. M. Link, et al.. Application of genetic programming to high energy physics 
event selection hep-ex/0503007, to be published in Nucl. lustrum. Meth. 

[4] P. L. Frabetti, et al.. Description and performance of the Fermilab E687 
spectrometer, Nucl. lustrum. Meth. A320 (1992) 519-547. 

[5] P. L. Frabetti, et al., A Wideband photon beam at the Fermilab Tevatron to 
study heavy flavors, Nucl. lustrum. Methods A329 (1993) 62. 

[6] J. M. Link, et al.. The target silicon detector for the FOCUS spectrometer, 
Nucl. lustrum. Meth. A516 (2003) 364-376. 



9 



[7] J. M. Link, et al., Reconstruction of vccs, kinks, E 's, and 17 's in the FOCUS 
spectrometer, Nucl. Instrum. Meth. A484 (2002) 174-193. 

[8] J. M. Link, et al., Ccrenkov particle identification in FOCUS, Nucl. Instrum. 
Meth. A484 (2002) 270-286. 

[9] S. Eidelman, et al.. Review of particle physics, Phys. Lett. B592 (2004) 1. 

[10] M. R. Convery, Incorporating multiplicative systematic errors in branching ratio 
limits SLAC-TN-03-001. 

[11] S. Malvezzi, D-meson Dalitz fit from Focus, AIP Conf. Proc. 549 (2000) 569- 
574. 



10 



